Multi-Scale Residual Aggregation Feature Pyramid Network for Object Detection

نویسندگان

چکیده

The effective use of multi-scale features remains an open problem for object detection tasks. Recently, proposed detectors have usually used Feature Pyramid Networks (FPN) to fuse features. Since a relatively simple feature map fusion approach, it can lead the loss or misalignment semantic information in process. Several works demonstrated that using bottom-up structure Network shorten path between lower layers and topmost feature, allowing adequate exchange from different layers. We further enhance by proposing residual aggregation (MSRA-FPN), which uses unidirectional cross-layer module aggregate multiple triangular layer. In addition, we introduce Residual Squeeze Excitation Module mitigate aliasing effects occur when are aggregated. MSRA-FPN enhances high-level maps, mitigates decay during fusion, capability model large objects. It is experimentally our improves performance three baseline models 0.5–1.9% on PASCAL VOC dataset also quite competitive with other state-of-the-art FPN methods. On MS COCO dataset, method improve 0.8% model’s 1.8%. To validate effectiveness detection, constructed Thangka Figure Dataset conducted comparative experiments. 2.9–4.7% this reach up 71.2%.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

MSDNN: Multi-Scale Deep Neural Network for Salient Object Detection

Salient object detection is a fundamental problem and has been received a great deal of attentions in computer vision. Recently deep learning model became a powerful tool for image feature extraction. In this paper, we propose a multi-scale deep neural network (MSDNN) for salient object detection. The proposed model first extracts global high-level features and context information over the whol...

متن کامل

Template Matching based Object Detection Using HOG Feature Pyramid

The Object detection based on histogram of oriented gradients was introduced in the context of detecting humans in images [1]. Detecting human subject in the image is a difficult task due to the fact that they appear in different appearance and poses. Thus the need to come up with a feature that allows human dorm to be detected cleanly even in cluttered background under different illumination. ...

متن کامل

Qualitative Multi-scale Feature Hierarchies for Object Tracking

This paper shows how the performance of feature trackers can be improved by building a hierarchical view-based object representation consisting of qualitative relations between image structures at different scales. The idea is to track all image features individually, and to use the qualitative feature relations for avoiding mismatches, resolving ambiguous matches and for introducing feature hy...

متن کامل

EdgeStereo: A Context Integrated Residual Pyramid Network for Stereo Matching

Recently convolutional neural network (CNN) promotes the development of stereo matching greatly. Especially those end-to-end stereo methods achieve best performance. However less attention is paid on encoding context information, simplifying two-stage disparity learning pipeline and improving details in disparity maps. Differently we focus on these problems. Firstly, we propose an one-stage con...

متن کامل

Multi Sensor Fusion for Object Detection Using Generalized Feature Models

This paper presents a multi sensor tracking system and introduces the use of new generalized feature models. To detect and recognize objects as selfcontained parts of the real world with two or more sensors of the same or of several types requires on the one hand fusion methods suitable for combining the data coming from the set of sensors in an optimal manner. This is realized by a sensor fusi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Electronics

سال: 2022

ISSN: ['2079-9292']

DOI: https://doi.org/10.3390/electronics12010093